In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions
نویسندگان
چکیده
In this paper, we describe a multichannel method of noisy speech recognition that can adapt to various in-car noise situations during driving. The method allows us to estimate the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by multiple distributed microphones. Through clustering of the spatial noise distributions under various driving conditions, the regression weights for MRLS are effectively adapted to the driving conditions. The experimental evaluation shows an average error rate reduction of 43 % in isolated word recognition under 15 different driving conditions.
منابع مشابه
Optimizing regression for in-car speech recognition using multiple distributed microphones
In this paper, we address issues in improving handsfree speech recognition performance in different car environments using multiple spatially distributed microphones. In previous work, we proposed multiple regression of the log-spectra (MRLS) for estimating the logspectra of speech at a close-talking microphone. In this paper, the idea is extended to nonlinear regressions. Isolated word recogni...
متن کاملCENSREC2: corpus and evaluation environments for in car continuous digit speech recognition
This paper introduces a common database and an evaluation framework for connected digit speech recognition in real driving car environments, CENSREC-2, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. Speech data of CENSREC-2 was collected using two microphones, a close-talking microphone and a hands-free microphone, under three car speeds and four car conditions...
متن کاملMultiple Regression of Log-spectra Fo
This paper describes a new multichannel method of noisy speech recognition, which estimates the log spectrum of speech at a close-talking microphone based on the multiple regression of the log spectra (MRLS) of noisy signals captured by the distributed microphones. Since the method does not assume the arrangement of sound sources and microphones, it can be applied to in-car speech recognition d...
متن کاملCENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments
This paper introduces a common database, an evaluation framework, and its baseline recognition results for in-car speech recognition, CENSREC-3, as an outcome of the IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. CENSREC-3, which is a sequel to AURORA-2J, has been designed as the evaluation framework of isolated word recognition in real car-driving environments. Speech data wer...
متن کاملDetection of Local Disturbances and Simultaneously Active Speakers for Distributed Speaker-Dedicated Microphones in Cars
For automotive hands-free and speech recognition applications, distributed microphones are often mounted in the car where each of the speakers has a dedicated microphone close to his position. To provide additional control information for further speech enhancement, it is often advantageous to distinguish between the activity of the different passengers. In this contribution speaker activity is...
متن کامل